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Abstract: We consider possible reconstructions of a binary image of which the row and column 
sums are given. For any reconstruction we can define the length of the boundary of the image. 
In this paper we prove a new lower bound on the length of this boundary. In contrast to simple 
bounds that have been derived previously, in this new lower bound the information of both row 
and column sums is combined. 



1 Introduction 

An important problem in discrete tomography is to reconstruct a binary image on a lattice 
from given projections in lattice directions [71 [8] . Each point of a binary image has a value 
equal to zero or one. The line sum of a line through the image is the sum of the values of 
the points on this line. The projection of the image in a certain direction consists of all the 
line sums of the lines through the image in this direction. Any binary image with exactly 
the same projections as the original image we call a reconstruction of the image. 

For any set of more than two directions, the problem of reconstructing a binary image 
from its projections in those directions is NP-complete [5]. For exactly two directions, the 
horizontal and vertical ones, say, it is possible to reconstruct an image in polynomial time. 
Already in 1957, Ryser described an algorithm to do so [TU]. He also characterised the set 
of projections that correspond to a unique binary image. 

If there are multiple images corresponding to one set of line sums, it is interesting to 
reconstruct an image with a special property. In order to find reconstructions that look 
rather like a real object, two special properties in particular are often imposed on the 
reconstructions. The first is connectivity of the points with value one in the picture [2|[5|[TT]. 
The second is hv-convexity: if in each row and each column, the points with value one form 
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one connected block, the image is called hv-convex. The reconstruction of hv-convex images, 
either connected or not necessarily connected, has been studied extensively IH 121 [3l lU [11]. 

Another relevant concept in this context is the boundary of a binary image. The boundary 
can be defined as the pairs consisting of two adjacent points, one with value and one 
with value 1. Here we use 4-adjacency: that is, a point is adjacent to its two vertical and 
to its two horizontal neighbours [9j. The number of such pairs of adjacent points with two 
different values is called the length of the boundary or sometimes the perimeter length [S]. 

In this paper we will consider given line sums that may correspond to more than one binary 
image. Since the boundary of real objects is often small compared to the area, it makes 
sense to look for reconstructions of which the length of the boundary is as small as possible. 
In particular, if there exists an hv-convex reconstruction, then the length of the boundary 
of that image is the smallest possible. In that sense, the length of the boundary is a more 
general concept than hv-convexity. 

The question we are interested in in this paper is: given line sums, what is the smallest 
length of the boundary that a reconstruction fitting those line sums can have? We can give 
two straightforward lower bounds on the length of the boundary, given the row and column 
sums. Both are equivalent to bounds given by Dahl and Flatberg in [H Section 2]. 

The first is that every column with a non-zero sum contributes 2 to the length of the 
horizontal boundary, while every row with non-zero sum contributes 2 to the length of the 
vertical boundary. So if there are m non-zero row sums and n non-zero column sums, then 
the total length of the boundary is at least 2n + 2m. 

For the second bound we use that if the row sums of two consecutive rows are different, the 
length of the horizontal boundary between those rows is at least the absolute difference 
between those row sums. A similar result holds for the column sums and the vertical 
boundary. So if an image has row sums ri, r2, . . . , and column sums Ci, C2, . . . , c„, then 
the length of the boundary is at least 

m— 1 n— 1 

ri + ^ \ri - ri+i\ + + Ci + ^ \cj - Cj+i\ + Cn- 
i=i j=i 

Despite being simple, these bounds are sharp in many cases. For example, the first bound 
is sharp if and only if there exists a hv-convex image that satisfies the line sums. On the 
other hand it is clear that much information is disregarded in these bounds. The first bound 
does not use the actual value of the non-zero line sums at all, while the second bound only 
uses the column sums to estimate the length of the vertical boundary and only the row 
sums to estimate the length of the horizontal boundary. 
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In this paper we prove a new lower bound on the length of the boundary that combines 
the row and column sums. After introducing some notation in Section [2| we prove this 
bound in Section [3j Some examples and a corollary are in Section |4} Finally, in Section [5] 
we derive an extension of the bound that gives better results in certain cases. 



2 Definitions and notation 

Let F be a finite subset of 1? with characteristic function x. (That is, x^^^ Z/) = 1 if ip^-, V) ^ 
F and y) = otherwise.) For i G Z, we define row i as the set {{x,y) E Z"^ : x = i}. We 
call i the index of the row. For j G Z, we define column j as the set {{x,y) G : y = j}. 
We call j the index of the column. Following matrix notation, we use row numbers that 
increase when going downwards and column numbers that increase when going to the right. 

The row sum Vi is the number of elements of F in row i, that is = YljezXih j)- The 
column sum Cj of F is the number of elements of F in column j, that is Cj = Yliez xihJ)- 
We refer to both row and column sums as the line sums of F. We will usually only consider 
finite sequences 71= (ri, r2, . . . , r^) and C = (ci, C2, . . . , c„) of row and column sums that 
contain all the nonzero line sums. 

Given sequences of integers 71 = (ri, r2, . . . , r^) and C = (ci, C2, . . . , c„), we say that (TZ, C) 
is consistent if there exists a set F with row sums 7Z and column sums C. Define hi = #{j : 
Cj > i} for z = 1, 2, . . . , m. Ryser's theorem [TU] states that if ri > r2 > . . . > r^, the line 
sums {7Z, C) are consistent if and only if for each = 1, 2, . . . , m we have Yl!i=i — Yli=i ^i- 
From this we can conclude a similar result for the case of not necessarily non-increasing 
row sums: if the line sums {7Z, C) are consistent, then for all = 1, 2, . . . , m we have 

k k 

Y.^^>Y.'^■ (1) 

i=l i=l 

The converse clearly does not hold. 

We can view the set F as a picture consisting of cells with zeroes and ones. Rather than 
{i-ii) G -F, we might say that (i, j) has value 1 or that there is a one at Similarly, for 

{i-ii) ^ F we sometimes say that {i,j) has value zero or that there is a zero at {i,j)- 

We define the boundary of F as the set consisting of all pairs of points [{i,j), {i',j')) such 
that 

• i = i' and |j — j'| = 1, or \i — i'\ = 1 and j = j', and 
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One element of this set we call one piece of the boundary. We can partition the boundary 
into two subsets, one containing the pairs of points with i = i' and the other containing the 
pairs of points with j = j'. The former set we call the vertical boundary and the latter set 
we call the horizontal boundary. We define the length of the (horizontal, vertical) boundary 
as the number of elements in the (horizontal, vertical) boundary. 



3 The main theorem 

Theorem 1. Let row sums IZ = (ri, r2, . . . , r^) and column sums C = (ci, C2, . . . , c„) 
be given, where ri = n, = 0- Let be the total length of the horizontal boundary 
of an image with line sums {TZ,C). Define hi = #{j : Cj > i} and di = hi — Vi for 

i = 1, 2, . . . , m. For any integer t > and any subset {ii, ^2, • • • , i2t+i} C {1, 2, ... , m} with 

ii < 12 < ■ ■ ■ < ^21+1 we have 



Proof. First we prove (|2]) by induction on n. In the initial case n = we have di = hi = 
= for all hence we have to prove that Lh > 0, which is obviously true. 

Now let n > 1 and consider a binary image F with line sums {TZ, C). Let / C {1, 2, ... , m} 
be the set of indices i such that cell {i,n) has value 1. Note that = c„. Let F' be the 
binary image we obtain by deleting column n from F. Let {r[,r2, . . . , r^) be the row sums 
of F'. The column sums of F' are (ci, 02, ... , Cn-i), and define h[ = 7^{j < n — 1 : Cj > i} 
and d[ = h[ — r- for i = 1, 2, . . . , m. We have 



Lji ^ 272 -|- di^ di^ -\- di^ • • • di^^ -\- 'Idi^-f.^^ , 



(2) 
(3) 




ri if i ^ J, 
— 1 if 2 G /, 




1 if i < c. 



if i > c, 



and therefore 




di — 1 if i ^ I and i < Cn, 
di if i ^ I and i > Cn, or i G / and i < c, 

di + 1 if z G / and i > Cn. 
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As induction hypothesis we assume that ^ is true for the smaller image F'. So for the 
total length L'j^ of the horizontal boundary of F' we have 



> 2(n - 1) + <^ - + <3 + 2< 



'«2t+l ■ 



Let 2B be equal to the horizontal boundary in column n of F. Then L/^ = L'^ + 2S. We 
want to prove ([2]), hence it suffices to prove 

2B-2> (d,, - <J - {d,, - 4) + (rf,3 - 4) (rf,,, - 4J + 2id,,^^, - d[^^J. (4) 

Write the right-hand side as 

t 



S = l 



Note that 

{1 if z ^ / and i < Cn, 

if z ^ / and i > Cn, or z G / and z < c„, 

— 1 if i G / and i > c„. 

The only possible values of (rfia^.i — c^ias-i) ~ ('^«2s ~ '^i2J ^'^^ therefore —1, 0, 1 and 2. If 
we have i2s-i,i2s < c„ or i2s-i,i2s > c„, then the value 2 is not possible and 

{di,,_, - - {di^, - 4 J = 1 ^ i2s-i ^ / and is. e /. 

Furthermore note that of the 2B pieces of horizontal boundary in column n, one is above 
row 1 (as ri = n, so 1 G /) and exactly B — 1 are between a pair of cells with row indices 
i and i + 1, such that i ^ I and i + 1 G /. We now distinguish between four cases. 

Case 1. Suppose Z2t+i < c„ and Z2t+i ^ Then 2((ij2j^^ — d[^^^^) = 2. In the first c„ cells of 
column n, there is at least one cell (the one with row index i2t+i) that has value 0, hence 
B > 2 and there is a cell with row index greater than i2t+i with value 1. This means that 
there are at most B — 2 pairs (^25-15 "^25) such that Z2s-i ^ I and Z2s G /. Also, 123-1,^23 < c„ 
for all s. So 

E ((^^-- - - (^^2. - 4J) + 2(ci.,,+, - < (5 - 2) + 2 = 5 < 25 - 2. 

s=l 



Case ^. Suppose i2t+i < c„ and i2t+i G Then 2{di^^^^ — d[^^^^) = 0. Now there are at 
most B — 1 pairs (^25-1, "^25) such that i2s-i ^ and Z2s ^ I- Also, 123-1^23 < c„ for all s. 
So 

E ((^^--1 - - (^^- - 4J) + 2(c?.2*+i - <.+J < 5 - 1 < 25 - 2. 
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Case 3. Suppose i2t+i > Cn and B > 2. Then 2{di^^^-^ — d[^^^_^) < 0. Again there are at most 
B — 1 pairs (i2s-i,'^2s) such that i2s-i ^ I and G /. If there does not exist an u such 
that i2u-i < Cn and i2u > Cn, then we are done, as in the previous case. If there does exist 
such an u, then 

(c?i2„_i - J - (c^i2„ - <2 J = 2 ^ 22«-i ^ / and i2« e /. 

If {di^^_^ — d[^^_^) — (di^^ — d[^^) = 2, then on the right-hand side of (j4| we have a 2 and 
at most B — 2 times a 1. If not, then we have no 2 and at most B times a 1. In both cases 
we find 

t 

E ((^^-- - <^sJ - id^2s - <J) + m..^. - <B<2B-2. 

s=l 



Case 4- Suppose B = 1. Then i E I <^ i < Cn, hence 



d[ = di for all i. 



Therefore 

t 



E ((^^-- - <^sJ - (di,. - d[J) + 2{d,,,^, - d[^^J = = 25 - 2. 



s=l 



In all possible cases we have now proved inequality Q, which finishes the proof of ([2]). 

Now we prove (|3|. Let F be a binary m x n image with row sums TZ and column sums C. 
Define F as the binary m x n image that has zeroes where F has ones and ones where F 
has zeroes. Let (fi, . . . , fm) be the row sums of F and (ci, . . . , c„) the column sums. Define 
bi = #{j : Cj > i} and di = bi — f^+i-i for i = 1, 2, . . . , m. As fj = n — r j and Cj = m — Cj 
for all i and j, we have 

bi = ■.m-Cj>i} = : Cj <m-i} = n- : cj > m + 1 - i} = n - bm+i-i- 

Hence 

di = bi — r^+i-i = n — bj^+i-i — n -\- r^+i-i = —dm+i-i- 

As fi = and = n, we may apply ^ to the row sums {fm,rm-i, ■ ■ ■ We write 
the subset of the row indices we use as (m + 1 — i2t+i, m + 1 — i2t, . . . , m + 1 — ii) with 
2i < ^2 < • • • < i2t+i- We find that for the total length Lh of the horizontal boundary of F 
holds: 

Lh ^ 2?T. -|- dm^i_i^_^^-^ djYi+l—i2t ^~ 12*-! ' ' ' '^m+l— 42 2(i^_|_]^_j-|^ 

2tI C?22t+l ~^ di2t di'2t—l ~t~ * * * ^di^ . 
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In each column of F, the number of horizontal pieces of boundary is equal to the number 
of pairs of neighbouring cells such that one cell has value 1 and the other has value 0, plus 
one for the boundary below row m. In each column of F, the number of horizontal pieces of 
boundary is equal to the number of pairs of neighbouring cells such that one cell has value 
1 and the other has value 0, plus one for the boundary above row 1. As in each column 
the number of pairs of neighbouring cells such that one cell has value 1 and the other has 
value 0, is the same in F and in F, we have Lh = L^. Hence 

Lh ^ 2n — di2^_^_^ + (ijjt — '^i2t-i + ■ ■ ■ + di^ — 2di^. 



□ 



4 Some examples and a corollary 



To illustrate Theorem [T} we apply it to two small examples. 

Example 1. Let m = n = 10 and let row sums (10, 7, 7, 5, 4, 3, 5, 6, 1, 0) and column sums 
(8, 8, 8, 8, 6, 3, 2, 2, 2, 1) be given. We compute 6j and di, i = 1,2, ... ,10 as shown below. 



i 


1 2 


3 


4 


5 


6 7 


8 9 10 


hi 


10 9 


6 


5 


5 


5 4 


4 




10 7 


7 


5 


4 


3 5 


6 1 


di 


+2 


-1 





+1 


+2 -1 


-2 -1 


We take t = 1, 


k = 


2, 


i2 = 


3 and = 


6. Now g tells 



Lh> 20 + 2- (-1) + 2 • 2 = 27. 

Alternatively, we take t = 2, ii = 2, ^2 = 3, = 6, = 8 and = 10. Now ^ tells us 
that 

Lh> 20 + 2- (-1) + 2 - (-2) + 2 ■ = 27. 



As Lfi must be even, we conclude > 28. This bound is sharp: in Figure 1(a) a binary 
image F with the given row and column sums is shown, for which Lh = 28. 

Example 2. Let m = n = 10 and let row sums (10, 9, 7, 6, 8, 4, 5, 2, 3, 0) and column sums 
(9, 8, 8, 6, 6, 4, 4, 4, 3, 2) be given. We compute bi and di, i = 1,2, ... ,10 as shown below. 



i 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


bi 


10 


10 


9 


8 


5 


5 


3 


3 


1 





Ti 


10 


9 


7 


6 


8 


4 


5 


2 


3 





di 





+ 1 


+2 


+2 


-3 


+ 1 


-2 


+1 


-2 






7 



10 



10 
9 
7 
6 



^ 

8888632221 
(a) The length of the horizontal 
boundary of this image is 28. 





9886644432 
(b) The length of the horizontal 
boundary of this image is 32. 



Figure 1: The binary images from Examples [l] and [2j The grey cells have value 1, the other cells 
value 0. The numbers indicate the row and column sums. 



We take t = 2, = 5, 22 = 6, 23 = 7, ^4 = 8 and = 9. Now ^ tells us that 
Lh > 20 - (-2) + 1 - (-2) + 1 - 2 ■ (-3) = 32. 



This bound is sharp: in Figure [1(b) a binary image F with the given row and column sums 
is shown, for which Lh = 32. 



In the Introduction we mentioned two simple bounds of the length of the boundary. We 
recall them here, just for the horizontal boundary. The first one uses that in every column, 
there are at least two pieces of boundary, so if there are n columns with nonzero sums, 
then 

Lh > 2n. (5) 

The other bound computes the sum of the absolute differences between consecutive row 
sums, which yields 



m—l 



Lh>ri + '^ \ri -ri+i| + 



(6) 



In order to compare the bounds in Theorem [T] to these two simple bounds, we construct 
two families of examples. 

Example 3. Let the number of columns n be even. Let m = n + 2. Define line sums 
C = (n, n, n — 2, n — 2, . . . , 4, 4, 2, 2), 71 = (n, n — 1, n — 1, n — 3, n — 3, . . . , 3, 3, 1, 1, 0). 
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We calculate 

{bi, 62, • • • , &m) = (n, n, n - 2, n - 2, . . . , 2, 2, 0, 0), 
(di, d2,..., dm) = (0, +1, -1, +1, -1, . . . , +1, -1, +1, -1, 0). 
Now g tells us that 



n 



Lh>2n + --{l 1) + 2 ■ = 3n. 

On the other hand, ([s]) says Lh > 2n, while ^ gives 

n — 2 

Lh>n + l + • 2 + 1 = 2n. 

So Theorem [1] gives a much better bound in this family of examples. In fact, it is sharp: 
there exists a binary image with the length of the boundary equal to 3n. Such an image is 



easy to construct; see for an example Figure 2(a) 



Example 4. Let m = n + 2. Define line sums 

C = (2,2,2,..., 2,2,2), 7^= (n, 1,1, 1,1,0). 

We calculate 

(61,62,. = (n,n,0,0,0,... ,0,0,0), 
{di,d2,...,dm) = (0,+(n- 1), -1,-1,-1,..., -1,-1, -1,0). 
Now g tells us that 

Lh>2n + 2-{n-l)=4:n-2. 
On the other hand, ([s]) says Lh > 2n, while ^ gives 

Lh > n + {n - 1) + 1 = 2n. 

So again Theorem [T] gives a much better bound. In fact, it is sharp: there exists a binary 
image with the length of the boundary equal to An — 2. Such an image is easy to construct; 



see for an example Figure 2(b) 



We can easily generalise the result from Theorem [T] to the case where the conditions r\ = n 
and = are not satisfied. 

Corollary 1. Let row sums 71= (ri, r2, . . . , Vm) and column sums C = (ci, C2, . . . , c„) be 
given. Let Lh be the total length of the horizontal boundary of an image with line sums 
(TZ, C) . Define b^ = : Cj > i} and di = b^ — for i = 1,2, . . . ,m. Also set do = dm+i = 
0. For any integer t > and any subset {ii, 12, ... , 121+1} C. {0, 1, 2, . . . , m, m + 1} with 
ii < 12 < ■ ■ ■ < ^2i+i we have 

Lh > 2ri + — di^ + di^ — ■ ■ ■ — di^^ + 2di^^^^, (7) 
Lh > 2ri - di^,^^ + di^^ - di^^_^ H Vdi^- 2di^. (8) 
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88664422 
(a) The length of the hori- 
zontal boundary of this im- 
age is 24 = 3n. 





22222222 
(b) The length of the hori- 
zontal boundary of this im- 
age is 30 = 4n — 2. 



Figure 2: Binary images from Examples [s] and [Zj with n = 8. The grey cells have value 1, the 
other cells value 0. The numbers indicate the row and column sums. 



Proof. Let F be a binary image with line sums {TZ, C) and a horizontal boundary of total 
length Lh- Construct F' by adding a row above row 1 with row sum n and a row below 
row m with row sum 0. Let L'^ be the length of the horizontal boundary of F'. We have 
L'f^ = Lh + 2{n — ri). The column sums of F' are c^- = Cj -|- 1, j = 1, 2, . . . , n. The row sums 
are r'^ = n, r[ = rj_i for i = 2, 3, . . . , m -|- 1 and = 0- Let b[ = #{j : c'j > i} and 

d', = h'- — r' for i = 1, 2, . . . , m. Then for all i = 2, 3, . . . , m -|- 1 we have 



K = #{J : c, + 1 > z} = #{j : c, > z - 1} 



SO d[ = — rj_i = di_i. Also, d[ = do = and d'^^2 = dm+i = 0. We apply Theorem [T] 
to F' with the set of indices {ii -|- 1, ^2 + 1, • • • , "^24+1 + 1} and we find 

>2n + - + <3+i + 2<^^^^+i 

= 2n + di^ - di2 + di,^ - ■ ■ ■ - di^.^ + 2di 



2t+l ' 



2n 



^121—1 ~l~ ' ' ' ~l~ dp 



2d, 



and therefore 



Lh ^ 2ri -|- (ijj — di^ -\- di^ — ■ ■ ■ — di^^ -\- 2(ij2t+i; 



□ 
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5 A variation 



Theorem 2. Let row sums TZ = {ri,r2, . . . ,rm) and column sums C = (ci,C2, . . . ,c„) be 
given, where ri = n, Vm = 0. Suppose there exists an image F with line sums (7?., C) and let 
Lh{F) be the total length of the horizontal boundary of this image. Define hi = ^{j : cj > i} 
and di = hi — for i = 1,2, ... ,m. Let k be an integer with 2 < k < m — 1 such 
that (ifc < and dk+i > 0. Let a = Yli=i^k- For any integers t,s > and any sets 
{ii,i2, . . . ,i2t+i} C {1,2, ... ,k-l,k^m} with ii < i 2 < • • • < ^2t+i and {ii, ^2, • • • , «2s+i} C 
{1, k + 1, k + 2, . . . ,m — 1, m} with ii < 12 < . ■ ■ < Z2s+i we. have 

Lh{F) >2n + di^ - di^ + di^ di^^ + 2di^^^^ 

+ -<h, + <h, 4. + 24.+. - ^- (9) 

Proof. We will prove the theorem by induction on a. Note that by ([T]) we have a > 0, since 
the line sums are consistent. 

As we are only considering the horizontal boundary, we may for convenience assume that 
Cl > C2 > . . . > c„. 

Suppose (7 = 0. Then 

k k k 

J2r, = Y,h = : c, > 2} = ^3 + E 

i=l i=l 2=1 j\cj<k j\cj>k 

So in any column j with Cj > k we must have G F for 1 < i < k, and in any column 
j with Cj < k we must have ^ F for k + 1 < i < m. This means that we can split 
the image F into four smaller images, one of which contains only ones and one of which 
contains only zeroes. The other two parts we call Fi and F2 (see Figure [s]). In order to 
have images with the first row filled with ones and the last row filled with zeroes, we glue 
row m to Fi and row 1 to F2. More precisely, let Fi consist of rows 1,2, . . . , k — 1, k and m 
of F and the columns j with Cj < k; let F2 consist of rows 1 and k + 1, k + 2, . . . ,m — l,m 
of F and the columns j with cj > k. 

The columns of F with sum at most k are exactly the columns with indices greater than 
bk+i ■ Define h = bk+i ■ Let r |^ \ \ . . . , , be the row sums of Fi, and let r\^\ ^i+i; 
• • • , r^^-i, rm be the row sums of F2. We have 

r-^'* = Ti — h, for 1 < z < fc, and r^^ = r^, 

rf^ = ri for k + 1 < i < m, and rf''' = h = ri — {n — h) . 
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m 



Figure 3: Splitting the image F into four smaller images. 



Let cf^li, c^hl2^ • • • ! c^^-ii Cn ^ be the column sums of Fi, and let cf'\ . . . , c^^-i-> '^h^ ^e 
the column sums of F2. We have 

c^-^-* = Cj, and cf^ — Cj — (k — 1) for all j. 



Define 



b? = if{j>h + l: cf > 1}, 6f ^ = #{j < h : cf > 1}, 

fc^'^ = #0' > + 1 : ^ > 2}, = #{j < /i : cf > 2}, 

= nj>h + l: cf > k}, h^l, = #{j <h:cf>m- k}, 

= nj>h+l: cf >k + l}, = #{j < : cf ) >m-k + l}. 



For 1 < i < A; we have 

6f ^ = #{j > + 1 : cf ) > = #{j < n : c,- > z} - #{j < : c,- > = 6, - /i. 

Also, = = hm- For A; + l<i<mwe have 

6f ^ = #{j < /i : cf ) > z - + 1} = #{j < : c,- > 1} 

= #{j < n : Cj > - #{j > ^ + 1 : c,- > i} = 6i - = 6,. 

Also, hf — h = bi — (n — h). Now define df = bf — rf for i e {1, 2, . . . , A; — 1, k, m} and 
df = hf^ - rf for i e {1, A; + 1, A; + 2, . . . , m - 1, m}. We find 

df — bi — h — {vi — h) — di, ioT 1 < i < k, 
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r/^^^ = h - r = d 

m "1 "1 "'m; 

d\ = bi — Ti = di ioi k + 1 <i <m 
d\ —hi — [n — h) — (ri — (n — h)) = di. 
All in all we conclude A-^^ = rf,- and rf,-^'' = rf,- for all i. 



The total length of the horizontal boundary of F in the columns j with Cj < k is exactly 
the same as the total length Lh{Fi) of the horizontal boundary of Fi. The total length 
of the horizontal boundary of F in the columns j with Cj > k is exactly the same as the 
total length Lh{F2) of the horizontal boundary of F2. So Lh{F) = Lh{Fi) + Lh{F2). Note 
that Fi has n — bk+i columns and F2 has bk+i columns. By Theorem [l] applied to Fi we 
know that for any integer t >0 and any set ^2, • • • , ^2t+i} C {1,2, . . . ,k — l,k, m} with 
H < 12 < ■ ■ ■ < ^2t+i we have 



Lh{Fi) >2{n- bk+i) + di^ -di^+di.^ di^^ + 2d 



By the same theorem applied to F2 we know that for any integer t > and any set 
{ii,i2, . . . , i2s+i} C {l,k + l,k + 2, . . . ,m - l,m} with ii < i2 < • • • < "^28+1 we have 

Lh{F2) > 26fc+i + d.^ -4+4 4^ + 24^^^. 

Adding these two results yields ([9]). 

Now let 0" > 1 and suppose that we have already proven the theorem for any image with 
I]f=i di < a. Let 

Ai = max{(ijj di^ + c/j^ — ■ ■ ■ c?^^^ -|- 2(ij2t^i}, 
A2 = max{d.^ - 4 + 4. + 

where the first maximum is taken over all integers t > and sets {ii,i2, ■ ■ ■ ,i2t+i} C 
{1,2, . . . , k — 1, k, m} with < 22 < • • • < '*2t+i, and the second maximum over all integers 
s > and sets {ii, 12, ... , ^25+1} C {l,k + l,k + 2, . . . ,m — l, m} with ii < 12 < . . . < i2s+i- 
Furthermore, fix ii, 12, ... , «2t+i and ii, 12, ... , i2s+i such that these maxima are attained. 

Since (ifc < by definition of k, and since = 0, we have 

di2 ~t~ ^^23 ' ' ' ^12^ ~t~ 2c//^ ^ c/^i ~\~ di^ • • • di2^ -\- 2dfti. 
If ^2t+i = k, this would contradict the maximality of Ai, so we conclude 

i2t+i + k. (10) 

We also know dk+i > by definition of k, and di = 0. So if s > 1, then 

di - 4+1 + 4 4, + 24^^^ < 4 4^ + 24^^^. 
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This means that if s > 1, we may assume without loss of generahty that {ii, ^2) 7^ (1, A; + l). 
Also, 

di-d-^ + d-^ 4. + ^ 4+1 -di, + d-^ 4^ + 24^^^. 

This means that if s > 1 and i2 > k + 1, we may assume that ii 7^ 1. Finally, 

2rfi < 24+1, 
so if s = 1 we may also assume that ii ^ 1. 

All in all we may assume in all cases that 

ii 7^ 1. (11) 

It suffices to prove 

Lh{F) >2n + Ai + A2-a. (12) 

Let j with 1 < j < n be such that #({(1, j), (2, j), . . . , {k,j)} n F) < min(cj, k), i.e. in 
column j there is at least one one in rows k + l,k + 2, . . . ,m and at least one zero in rows 
1,2, ... ,k. Such a column exists, because 

k k k 

i=l i=l 1=1 j\cj<k j\cj>k 

We will now consider various cases. 

Case 1. Suppose that there exist integers l>2, h>k + l and u>0 such that I + u < k, 
h + u < m — 1 and 

• (l - e F, and 

• (/,j),(/ + l,j),---,(^ + «,j) and 

• ih,j),ih + l,j),...,{h + u,j) e F, and 

• (/i + M+ l,j) ^ F, and 

• (/ + 'u + l,j) eFor (/i-l,j)^F. 
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l + u 



h 

h + u 



l + u 

k_ 

h 

h + u 



Figure 4: Two possibilities for column j in Case 1. The grey cells have value 1, the other cells 
value 0. 



We define a new image F' by moving the ones at {h,j), {h+l,j), . . . , {h + u,j) to {1 + 
'i-J):---:{l + uJ); that is, 

F' = FU (/ + ...,(/ + u,j)}\{{h,j), {h + 1, j), ...,{h + u,j)}. 

The column sums of F' are identical to the column sums of F. The row sums of F' are 
given by 

Vj + l iil<i<l + u, 

— 1 a h < i < h + u, 
Ti else. 

Define d[ = hi — r[ and a' = Yli=i d'^^ = o- — {u + 1) . By the induction hypothesis, we have 
for the total length L^^F') of the horizontal boundary of F' 

Lh{F') >2n + A[ + A'^ - a', 

where 



A[ = - + <3 + 2< 



I2t+1 ' 



A'=d'.-d'.+d'. d': +2d!~. . 

^ »1 12 «3 *2s «2s+l 

By moving the u + 1 ones in column j, the piece of horizontal boundary between row I — 1 
and row I has vanished, just like the piece of horizontal boundary between row h-\-u and 



15 



I 



II 



Figure 5: Moving ones in Case 1, in both possible configurations. The grey ceUs have value 1, 
the other cells value 0. 



h + u + 1. If {l + u+l,j) e F, the piece of horizontal boundary between row I + u and row 
l-\-u-\-l has also vanished, but there may be a new piece of horizontal boundary between 
row h — 1 and h. On the other hand, if {h — 1, j) ^ F, the piece of horizontal boundary 
between row h — 1 and row h has vanished, but there may be a new piece of horizontal 
boundary between row / + u and I + u + 1. At least one of both is the case. All in all, we 
have Ln{F') < Ln{F) - 2. 

Furthermore, some of the d'^ involved in A\ or ^2 may be different from the corresponding 
di. Since {ii, ^2, • • • , ^21+1} C {1, 2, . . . , A; — 1, /c, m}, we have d[ = di or d'^ = di — 1 for 
i e {ii,i2, . . . ,i2t+i}- The values of i for which d[ = di — 1, are all consecutive. Since 
the coefficients for di in Ai are alternatingly positive and negative, and there is only one 
positive coefficient that is +2 rather than +1, we have 

= <i - 4 + <3 <2* + 24*+i ^ di, -di, + di, di,^ + 2di,^^, -2 = A,-2. 

Since {ii,i2, • • • , hs+i} C {l,k + l,k + 2, . . . ,m — 1, m}, we have d'^ — di or d'^ — di + 1 
for i e {ii, 12, ■ ■ ■ , ^2s+i}- By a similar argument as above and by the fact that all negative 
coefficients in A2 are equal to —1, we have 

A2 >A2-1. 
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Finally, we have a' = a — {u + 1) < a — 1. We conclude 

U{F)>Lh{F') + 2 

>2n + A[ + A'^-a + 2 

>2n + {Ai - 2) + (^2 - 1) - (a - 1) + 2 

= 2n + Ai + A2-a. 



This proves (12) in Case 1. 



Case 2. Suppose that the conditions of Case 1 do not hold and furthermore that {k,j) G F 
and {k + 1, j) G F. Then there exist integers I > 2, h < k and u > such that h > I + 1, 
k + l<h + u<m — 1 and 



(/ - 1, j) G F, and 

il,j),il + l,j),...,ih-l,j)^F,and 
{h,j),{h + l,j),...,{h + u,j) G F, and 
{h + u + l,j)^F. 



As Case 1 does not apply, we cannot change all zeroes in (/, j), (/ + 1, j), • • • , {h — 
into ones by moving ones from {k + 1, j), {k + 2, j), . . . , {h + u,j). This implies that 
h — I > {h + u) — k > 1, so I < h — 1. We will now distinguish between several cases. 

Case 2a. Suppose that there does not exist an integer r with < r < t such that / = i2r+i- 
We define a new image F' by moving the one at {h + u,j) to (/, j); that is, 

F' = FU{{l,j)}\{{h + u,j)}. 

We define r'-, d'^, a', A[, and Lh{F') similarly as in Case 1. As in Case 1 we have 
A2 > A2 — 1. However, of the di with i G {1, 2, . . . , — 1, fc, m} only one has changed 
(namely d'l = rf; — 1), and we know that di does not have a positive coefficient in Ai. 
So A[ > Ax. Furthermore, Lh{F') = Lh{F) and a' = a — 1. By applying the induction 
hypothesis to F', we find 

L,(F) = Lh{F') 

>2n + A[ + A2 - or' 

>2n + Ai + {A2 - 1) - ((T - 1) 

= 2n + Ai + A2 - cr. 



This proves (12) in Case 2a 
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h + u 



(a) An 
example of 
column j in 
Case 2. 



(b) Moving the ones in 
Case 2a. 



(c) Moving tlie ones in 
Case 2b. 



Figure 6: Illustrations for Case 2 of the proof. The grey cells have value 1, the other cells value 
0. 
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(a) Moving the ones in (b) Moving the ones in 
Case 2cl. Case 2c2. 

Figure 7: More illustrations for Case 2 of the proof. The grey cells have value 1, the other cells 
value 0. 

Case 2b. Suppose that there does not exist an integer r with < r < t such that h — 1 = 
i2r+i- We define a new image F' by moving the one at {h + u,j) to {h — 1, j); the rest of 
the proof is the same as in Case 2a. 

Case 2c. Suppose neither Case 2a nor Case 2b applies. Then there are integers ri and r2 
with < ri < r2 < t such that / = ?2ri+i and — ! = i2r2+i- Note that ri < t, so di has 
coefficient +1 in Ai. Now let v = i2ri+2 < h — 1. Again, we distinguish between two cases. 

Case 2c 1. Suppose that k + 1 < h + u — v + I. Then we define a new image F' by moving 
the ones at {h + u — v + l,j), {h + u — v + I + 1, j), . . . , {h + u,j) to {I + 1, j), ■ ■ ■ , 

{v,j); that is, 

F' = FU{(l,j), + (v,j)}\{(h + u-v + l,j), (h + u-v + l + l,j), (h + u,j)}. 

We define r^, d'^, a', A[, and Lh{F') similarly as in Case 1. As in Case 2a we have 
^2 > ^2 - 1 and Lh{F') = Lh{F). Also, a' < a - \. Furthermore, of the di with i e 
{1, 2, . . . ,k — l.k, m} exactly two have changed: d'^ = di — 1 and d'^ = d^ — 1. As di has 
coefficient +1 in Ai and dy has coefficient — 1 in Ai, we have A[ — Ai. By applying the 
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induction hypothesis to F', we find 



Lh{F) = Lh{F') 

>2n + a; + ^2 - a' 

> 2n + A + (^2 - 1) - ((7 - 1) 

= 2n + Ai + A2- a. 



This proves (12) in Case 2cl. 



Case 2c2. Suppose that k + l>h + u — v + l. Then we define a new image F' by moving 
the ones at + (A; + 2,j), + to + ...,(/ + /i + m - /c - 

that is, 

F' = F U {(/, j), + 1, j), • • • , (/ + + w - A; - 1, j)}\{(^ + 1, j), (/^^ + 2, j), . . . , (/i + j)}- 

We define r^, c?'^, cr', A'^, A'2 and Lh{F') similarly as in Case 1. As in Case 2c 1 we have 
Lh{F') = Lh{F) and a' < a — 1. Since l + h + u — k — 1 < v, of the di with i G 
{1, 2, . . . , k — 1, k, m} exactly one has changed: (![ = di — 1 . As di has coefficient +1 in Ai, 
we have A[ = Ai — 1. 

Now we consider A'2. Some of the di with i G {^1,^25 • • • ,'^23+1} may have increased by 1. 
If ii > h + u, none of the row indices k + 1, k + 2, ...,h + u occurs in {ii, 12, ... , i2s+i}, 



and we have A'2 = A2. If not, then k + l<ii<h + u (using (11)). The values of i for 
which d'i = di + 1, are all consecutive. Since the coefficients for di in Ai are alternatingly 
positive and negative, and since ii (which has a positive coefficient in Ai) is included in 
{k + l,k + 2, . . . ,h + u}, we have A'2 > A2. 

By applying the induction hypothesis to F', we find 

Lh{F) = Lh{F') 

>2n + a; + A'2 - o' 

> 2n + (Ai - 1) + A2 - (a - 1) 

= 2?2 + Ai + A2 - cr. 



This proves (12) in Case 2c2, which completes the proof of Case 2. 



Case 3. Suppose that the conditions of Case 1 and Case 2 do not hold. By definition of j 
we know that in column j there is at least one one in rows /c + 1, A; + 2, . . . , m. As Case 
2 does not apply, we have (/c, j) ^ F or (A; + 1, j) ^ F. If (fc, j) G F (so (fc + 1, j) ^ F) we 
can apply Case 1: let I be the smallest integer such that (/,j) ^ F, let h! be the greatest 
integer such that {h' ,j) G F, and let u be maximal such that ^ F for I < i < I + u 
and («, j) & F for h' ~ u < i < h' . Define h = h' — u. Since (fc, j) G F and (A; + 1, j) ^ F, 
we have I + u < k and /i > A; + 1, so all conditions of Case 1 are satisfied. 
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'1-21 + 1 



l2t+l 



(a) An 
example of 
column j in 
Case 3. 



(b) Moving the ones in 
Case 3a. 



(c) Moving the ones in 
Case 3b. 



Figure 8: Illustrations for Case 3 of the proof. The grey cells have value 1, the other cells value 
0. 



Hence we have {k,j) F. Now there exist integers h > k + 1 and u > such that 
h + u < m — 1 and 



ih-l,j)^F, and 

{i, j) & F for h < i < h + u, and 

{h + u+l,j)^F. 



Furthermore, let I < k be such that {I — e F and ^ F. Since Case 1 does not 

apply, there does not exist an integer u' such that I + u' < k, ^ F for I < i < I + u' 

and (/ + -u' + 1, j) G F. This means that {i.j) ^ F for aU i with I < i < k + 1. Also, we 
could still apply Case 1 if there are at least as many zeroes in (/, j), (/ + 1, j), . . . {k,j) as 
there are ones in {h,j), {h + . . . , {h + u,j). Hence we must have u + l>k — l + l. 
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We will distinguish between various cases. 



Case 3a. Suppose that either i2t+i < I or i2t+i = fn. This means that none of the di with 
I < i < k has coefficient +2 in Ai. Since u + 1 > k — I + 1, we have h + k — l<h + u, so 
there are ones at {h,j), {h + l,j), . . . , {h + k — We define a new image F' by moving 
those ones to (/, j), (/ + 1, j), . . . , {k,j); that is 

F' = F U {(/, j), + 1, j), • • • , ik,j)}\{{h,j), {h + l,j),...,{h + k- 

We define r^, d^, a', A'^^, ^2 and Lh{F') similarly as in Case 1. As in Case 1 we have 
> ^2 - 1. Furthermore, Lh{F') = Lh{F). 

Suppose / = k. Then only one di with i G {1,2, . . . , k — l,k,m} has changed, namely 
d'f^ = dk — 1- We know that dk does not have a positive coefficient in Ai, since k 7^ 224+1 
(see (10)) and 224-1 < A; — 1. So A[ > A\. Also, a' = cr — 1, so by applying the induction 
hypothesis to F' , we find 

L^F) = UiF') 

> 2ra + v4'i + ^2 - 

> 2n + + (A2 - 1) - (a - 1) 
= 2n + Ai + ^2 - a. 



Now suppose that / < k. Then we have o' < o — 2. Furthermore, none of the di with 
I < i < k has coefficient +2 in Ai, so A[ > Ai — 1. By applying the induction hypothesis 
to F' , we find 



LhiF) = LhiF') 

>2n + A\ + ^2 - a' 
>2n + {Ai - 1) + (A2 - 1) 
= 2n + Ai + A2 - a. 



(a -2) 



This proves (12) in Case 3a. 



Case 3b. Suppose that Z2t+i > ^, ^24+1 7^ and i2t+i k — 1. Using ([lOj), we then have 
I < ^2t+i <k-2. Since u + l>fc-/ + l, we find that m>A;-Z + 1>(/ + 2)-/ + 1>3. 
We define a new image F' by moving the ones at {h,j), {h + 1, j) and {h + 2, j) to (/, j), 
(/ + 1, j) and (/ + 2, j); that is, 

F' = FU {{1,3), {I + 1,3), {I + 2, j)}\{(/i, j), {h + 1, j), (/i + 2, j)}- 

We define r^, ci'., a', A[, A'2 and Lh{F') similarly as in Case 1. As in Case 1, we have 
>Ai-2 and A'^ > A2-I. Furthermore, Lh{F') = Lh{F) and ct' = cr - 3. By applying 
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the induction hypothesis to F\ we find 



>2n + a; + ^2 - a' 

>2n + (Ai - 2) + (^2 - 1) - (a - 3) 

= 2n + Ai + A2- a. 



This proves (12) in Case 3b. 



Case 3c. Suppose that neither Case 3a nor Case 3b apphes. Then we have Z2t+i = k — 1. 



Using (11), this means that ii>k + l>k — 1 = 121+1- We now apply Theorem 1 to the 



image F and the row indices {11,12, • • • , ^2*, A; — 1, fc, zi, Z2, . . . , i2s+i}'- 

Lh{F) > 2n + di^ - di^ ^ di^^ + dk^i - dk + d-^ - d-^ ^ d-^_^ + 2d-^^^^ 

= 2n + Ai - dk-i - dk + A2. 



By Ryser's Theorem [in] we have Yli=i > 0, since the fine sums are consistent, so 

k k-2 



a = ^^di = '^di + dk-i + dk> dk-i + 4- 



i=l i=l 

Hence 



Lh{F) >2n + Ai- dk-i - dk + A2 > 2n + Ai + A2 - a, 



which proves ( 12 ) in Case 3c. 



This finishes the proof of the theorem. □ 

Example 5. Let m = = 12 and let row sums (12, 8, 9, 8, 8, 5, 5, 2, 3, 2, 1, 0) and column 
sums (10, 8, 8, 8, 6, 6, 6, 3, 2, 2, 2, 2) be given. We compute bi and di, i = 1,2, ... ,12 as shown 
below. 

12 3 4 5 6 7 8 9 10 11 12 



bi 



12 12 8777441 100 
12 89885523210 



di +4 -1 -1 -1 +2 -1 +2 -2 -1 -1 
Here ^ yields at most 

L,, > 24 + 4 - (-1) + 2 - (-1) + 2 ■ 2 = 36, 

and ^ yields at most 

Lh>2A- (-2) + 2 - (-1) + 2 - (-1) + 4 - 2 ■ = 36. 
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10 88866632222 

Figure 9: The binary image from Examples [5| The grey cells have value 1, the other cells value 
0. The numbers indicate the row and column sums. The length of the horizontal boundary of this 
image is 38. 

However, we can apply Theorem [2] with k = 5 (note that < and > 0). We have 
o" = 1. If we take t = 0, s = 0, ii = 2, ii = 6, ^2 = 7 and = 8, then we find 

L;,> 24 + 2- 4 + 2- (-1) + 2 ■ 2 - 1 = 38. 

So in this example, Theorem [2] gives a better bound than Theorem [1} In fact, the bound 
of Theorem |2] is sharp in this example: in Figure |9] a binary image F with the given row 
and column sums is shown, for which Lh = 38. 

Corollary 2. Let row sums TZ = (ri, r2, . . . , r^) and column sums C = (ci, C2, . . . , c„) 
be given. Suppose there exists an image F with line sums {TZ,C) and let Lh{F) be the 
total length of the horizontal boundary of this image. Define hi = ^{j : Cj > i} and 
di = hi — ri for i = 1,2, ... ,m. Also set do = dm+i = 0. Let k be an integer with 1 < 
k < m such that dk < and dk+i > 0. Let a = XliLi'^fe- -^'^'^ o'^?/ integers t,s > and 
any sets {ii, ^2, • • • , ^21+1} C {0,1, . . . ,k — l,k,m + 1} with ii < 12 < ■ ■ ■ < 121+1 and 
{ii,i2, • • • , ^2s+i} C {0,k + l,k + 2, . . . ,m,m + 1} with ii < 12 < . . . < i2s+i we have 

Lh{F) > 2ri + di^ -di^ + di^ di^^ + 2di^^^^ 

+ dj, -d^, + d.^ d.^^ + 24^^^ - a. (13) 

Proof. Completely analogous to the proof of Corollary [Tj □ 
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